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On  An  Optimal  Stopping  Problem  of  Gusein-Zade 

By 

A.  Q.  Frank  and  S.M.  Samuels 
Purdue  University 

1.  Introduction. 

Gusein-Zade  [6]  studied  the  following  problem: 

A known  number,  n,  of  rankable  individuals  (rank  1 = best,  etc. ) 
are  to  arrive  in  random  order  (each  of  the  nf  possible  arrival 
orderings  is  equally  likely).  The  object  is  to  select  one  of  the  r 
best  individuals  (r  is  prescribed)  using  a stopping  rule  — so 
recalling  a previous  arrival  is  not  allowed  — which  is  based  only 
on  the  sequence  of  relative  ranks  — so,  in  effect,  all  that  can  be 
observed  about  each  arrival  is  how  many  of  its  predecessors  are  better 
than  it.  For  each  such  stopping  rule,  its  risk  is  the  probability 
that  it  does  not  select  one  of  the  r best  individuals,  and,  ’of  course, 
the  optimal  rule  is  the  one  with  the  smallest  risk. 

He,  in  effect,  derived  an  algorithm  for  computing  the  optimal  rule 
and  risk  for  each  r and  n (see  Section  2),  and  obtained  some 
asymptotic  results,  notably  that  the  optimal  risk  goes  to  zero  as  n 
and  r become  infinite. 

We  decided  to  use  the  general  asymptotic  results  of  Mucci  [7] 
to  see  what  happens  to  the  optimal  rules  and  risks,  for  various  values 
of  r as  n 00.  (For  r = 1,  the  risk  tends  to  1-e”^"  = .6321,  as 
is  widely  known;  for  r = 2 the  limiting  risk  is  . 4264  as  Gusein-Zade, 
and  also  Gilbert  and  Mosteller  [5]  , showed.  With  the  help  of  a computer, 
described  in  Section  3 ^ we  obtained  these  limiting  rules  and  risks  for 
r < 25  (see  Tables  1 and  2). 


as 
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The  most  suprising  feature  of  our  output  was  how  small  the  risks 


are.  Gusein-Zade' s argument  showed  only  that  the  limiting  (as  n oo) 
risk  goes  to  zero  as  r -*■  oo  at  least  as  fast  as  r ^ In  r.  But  our 
computations  strongly  suggested  that  it  goes  to  zero  exponentially  fast. 

We  say  this  because  we  knew  that,  for  each  r,  the  asymptotic  risk  is 
(l-t-^r))  where  t^(r)  is  the  limiting  (as  n •>  oo)  optimal  proportion 
of  individuals  to  let  go  by  before  being  willing  to  stop;  so  an  exponential- 
rate  of  convergence  of  the  risk  is  equivalent  to  t^(r)  being  bounded 
away  from  zero  and  one.  Indeed,  from  Table  1,  it  appears  that  t^r) 
is  tending  to  a limit  somewhere  near  .3. 

Inspired  by  these  computations,  and  aided  by  a model  which  is, 
in  effect,  the  "n  = oo"  case  (see  Section  4),  we  succeeded  not  only  in 
proving  that  the  rate  of  convergence  is  exponential,  but  also  in 
showing  the  existence  of  lim  t^(r)  = t and  in  evaluating  this  limit 
as  well  as  the  entire  asymptotic  (as  n ■*  oo,  r oo)  form  of  the  optimal 
rule.  These  results  are  in  Section  5*  with  proofs  in  Section  6. 

An  extraordinary  corollary  to  the  existence  of  t*  > 0 is  that, 

for  large  n and  r,  the  optimal  stopping  rule,  say  n , is  nearly 

"constant",  in  the  sense  that  the  proportion  of  individuals  seen  before 

actually  stopping  is,  with,  high  probability,  negligibly  greater  than  the 

proportion  seen  before  one  is  even  willing  to  stop.  To  be  precise, 

t /n  -*  t*  in  probability,  as  r and  n go  to  infinity  in  an  appropriate 
r,n 

way  (see  the  end  of  Section  5)* 
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2. 


Preliminari es . 


In  this  section  we  give  an  algorithm  for  computing  the  optimal 
rules  and  risks  in  Gusein-Zade' s problem  for  each  n and  r.  We 
obtain  it  as  a straightforward  application  of  the  method  of  backward 
induction,  as  described  in  Chow,  Robbins  and  Siegmund  [2],  which 
is  slightly  different  from  Gusein-*  Zade's  approach,  via  a Markov  chain. 

We  shall  use  the  following  notation: 


P 

n,r 


optimal  probability  of  selecting  one  of  the  r 
best  of  n arrivals; 


Q = 1-P  = optimal  (minimal)  risk; 

n,r  n,r  r x ’ 


Q,  Ji) 


n,r 


optimal  risk  among  all  rules  which  do  not  stop 
until  more  than  i individuals  have  arrived; 


H(k~l;r,i,n)  = conditional  probability  that  the  actual  rank  of 
the  i-th  arrival  is  greater  than  r,  given  that 
its  relative  rank  at  the  time  of  its  arrival  is 

k. 


Then  H(.;r,i,n)  is  just  the  cumulative  hypergeometric  distribution 
function 


(2.1) 


H(k;r,i,n) 


k 


E 

i=o 


<5 


and,  because  the  successive  relative  ranks  are  independent  random 
variables,  the  i-th  being  uniform  on  (l,2,...,i),  we  are  led  to  the 
algorithm: 


(2.2) 


Q (i-l)  = =■ 
n,  r l 


£min{Qn  p(i),  H(k-l;r,i,n)}  i = n,n-l,...,l 


k=l 
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with  the  boundary  condition 


(2.3)  Q^r(n)  = 1 . 

Implicit  in  the  algorithm  is  the  fact  that  the  optimal  stopping 
rule  stops  at  the  first  i (if  any)  for  which  the  relative  rank  of 
the  current  arrival  (say  k)  is  small  enough  so  that 


(2.4)  H(k-l;r,i,n)  < 0^  (1)  . 

The  left  side  of  (2.4)  is  decreasing  in  i while  the  right  side 
is  increasing,  so,  as  one  would  expect,  the  later  the  arrival,  the  less 
stringent  is  our  standard  for  selecting  it.  It  is  convenient  to 
designate  those  times  after  which  the  selecting  standard  is  successively 
relaxed.  Let  m.(n,r),  for  1 < j < r be  the  integer  satisfying 


(2.5) 


Qn,r^mj  (n>r)+1)  > H(j-l;r,nu  (n,r)+l,n)  . 


Then  the  m.'s  are  increasing  in  j and  the  optimal  rule  may  be 
described  as:  "Let  m^(n,r)  arrivals  go  by;  then  stop  at  the  first 
i > m (n,r),  if  any,  where  Z.  is  the  relative  rank  of  the  i-th 

La  • 1 

1 

arrival. 

Notice  that  Q,  = Q (0)  = Q (l)  = ...  = Q (m  (n,r)), 
n, r n,r  n,rv  / n,r  lv 

and,  from  (2.5), 
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(2.6) 


r-1  m (n,r)+l  r-1  m (n,r) 

TT  f1  - — > < \ r < TT  (1  - -^T- ) 

0=0  n 3 0=0  n 3 


Now  we  re-write  the  algorithm  (2.2)  as 


(2.7) 


o (i ) - Q (i-1)  t [«n)r(i)-H(i-l;r,l,n)]- 

J2  p v ' n y x ' lr— I ' 


1 

n 


n^r 


k=l 


l 

n 


and  note  that,  if  i is  allowed  to  vary  with  n,  then 


i/n ->  t H(k-ljr,i,n)  -»B(k-l;r,t) 


where  B(*;r,t)  is  the  cumulative  binomial  distribution  function 


(2.8) 


B(kjr,t) 


| (*)  t3(l-t)r'3  . 

d=0  3 


This  suggests  that,  if  we  let  i vary  with  n 

n ->  oo,  then  Q (i)  should  tend  to  a limit, 
3 n,r 

differential  equation 


so  that  i/n  t*  t as 
Q^(t),  satisfying  the 


1 « + - 

(2.9)  QL(t)=±  £ [Qr(t)  -B(k-l;r,t)]  0<t<l 

k=l 

with  the  boundary  condition  Q^(l  ) = 1.  Mucci  (1973)  has  shown  that 
this  is  true,  for  Gusein-Zade' s as  well  as  many  other  risk  functions. 
It  follows  that,  if  we  let  t.(r)  be  the  "time"  satisfying 


(2.10) 


Qr(tj  (r))  = B(o-l;r,tj(r)) 


for  o = 1,2,..., r,  then 
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(r,n)/n  ■>  (r) 

Q (•  ) is  constant  on  (0,t^(r)],  and 

(2-11)  *0*  = Qr(tx(r)) 

= (l-t1(r))r 

(in  fact,  the  convergence  is  monotone:  r is  an  increasing  function 

of  n,  as  Mucci  showed). 

Thus,  for  large  n,  the  optimal  rule  lets  approximately  t (r)*n 

arrivals  go  by;  then  stops  at  approximately  the  first  i such  that 

i/n  > t (r),  if  any,  where  Z.  is  the  relative  rank  of  the  i-th 

A.  1 

1 

arrival. 

Mien  r = 1,  all  of  this  is  elementary  and  widely  known, 
including  the  fact  that,  as  n oo, 

Pn  1 4 1-Q1=  6 = tj (1)  = lim  m^ (n,l)/n  . 

Results  for  r = 2 were  given  by  Gilbert  and  Mosteller  [5] 
(section  2d)  as  well  as  by  Gusein-Zade.  These  include 

m2 (n,2 )/n  2/3  ; 

m1(n,2)/n  ->cp  « .3^70 

where  <p  - in  <p  = 1-in  2/3;  and  hence 

P i 1-Qp=  l-(l-9)2  * -5736  . 
n,  <- 
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3 . Solving  the  Differential  Equation. 


On  each  interval  [t . (r), t . ..  (r)],  (2.9)  becomes 

J J J- 


Qv(r)  = t-1[jQ  (t)  - Yi  B(k;r,t)]  ; 


J-l 


dt  > 


k=0 


the  solution  is 


(3.1)  Qr(t)  = t^{c  - [ u“^+l)  Y B(k;r,u)du] 

J JO  k=0 


Since  the  t.'s  and  c.'s  are  not  known  in  advance,  we  must  derive 
J J 

them,  one-by-one,  working  backwards. 

We  use  the  boundary  condition:  0^(1")  = 1 and  the  fact  that 
Y ^”q  B(k;r,u)  = r(l-t)  to  get  c^  = r/(r-l),  and  hence,  for  r > 1, 


(3.2) 


Qr(t)  = l-r(t-tr)/ (r-1)  , tr(r)  < t < 1 . 


The  boundary  condition  (2.10),  with  j = r,  becomes 


^(^(r))  = l-t£(r)  ; 


substituting  (3.2)  yields 

(3-3)  tr(r)  = [r/(2r-l)]1//<-r"1^ 


which  was  derived  by  Gusein-Zade. 
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The  right  side  of  (3*2)  is  also  useful  for  t < tp(r)  because, 
for  any  t e (0,l),  it  is  the  limiting  risk  of  the  rules: 

"let  [nt]  arrivals  go  by,  then  stop  with  the  first  arrival,  if  any, 
of  relative  rank  < r".  If  we  let  t = 6(r)  with  0(r)  1 but 

I' 

0(r)  ->0,  then  the  right  side  of  (3.2)  goes  to  zero  as  r ■><». 

""1  / X* 

Gusein-Zade  used  these  very  rules,  with  6( r)  = r ' , • to  establish 

that  the  risks  go  to  zero  as  n and  r become  infinite. 

Returning  to  the  differential  equation?  for  j = r-l,r-2, . . . ,1, 

once  we  know  t.,n  (r)  we  can  solve  for  c.  in  (3.1)  --  taking 
J+1  0 

t = t.+^(r)  and  using  (2.10)  with  j replaced  by  j+lj  then  we  can 
solve  for  t.(r),  which,  by  (2.10),  is  the  root  of  the  equation 
{^(t)  - B(j-l;r,t)  = 0 with  Q^(t)  given  by  (3*l).  When  we  reach 

X* 

t^ (r ) we  are  finished,  because,  by  (2.1l),  (l-t^(r))  is  the 

limiting  risk,  Q^. 

These  computations  have  been  carried  out  for  r < 25  J some  results 
are  presented  in  Tables  1 and  2.  (The  same  numerical  results,  for  r < 10, 
were  obtained  by  Rasmussen  [8],  with  the  sole  exception  that  his  t-^(lO) 
is  .3128  while  ours  is  .3129.  He,  in  effect,  derived  our  formula  (3*1) 
in  its  expanded  form  directly  from  the  finite  n problem,  without  benefit 
of  Mucci’s  differential  equation.) 

As  we  remarked  in  the  introduction,  the  distinctive  feature  of 
Table  1- is  the  apparent  convergence  of  t^(r)  to  a non-zero  limit, 
which  would  imply  exponential  convergence  of  the  limiting  risks, 

Q(r),  to  zero.  This  will  be  confirmed  in  Section  5*  (We  were, 
however,  unable  to  prove  that  t^(r)  is  monotone  decreasing.) 
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TABLE  1 


Limiting  Minimal  Risk  : and  Optimal  Proportion  of  Arrivals 

to  Let  Go  By  Before  Considering  Stopping:  t^(r ) . 


r 

Q 

r 

\(r) 

r 

Q 

r 

t2(r) 

1 

.6321 

• 3679 

14 

.0058 

.3078 

2 

.4264 

.3470 

15 

.0041 

.3068 

3 

.2918 

.3367 

16 

.0029 

.3060 

4 

.2013 

• 3302 

17 

.0021 

.3052 

5 

.1397 

• 3255 

18 

.0015 

.3044 

6 

.0973 

.3219 

19 

.0010 

.3038 

7 

.0679 

.3190 

20 

.0007 

.3031 

8 

.0476 

.3166 

21 

.0005 

.3026 

9 

.0334 

.3146 

22 

.0004 

.3020 

10 

.0235 

.3129 

23 

.0003 

.3015 

11 

.0165 

• 3113 

24 

.0002 

.3011 

12 

.0116 

.3100 

25 

.0001 

.3008 

13 

.0082 

.3088 

00 

0 

.2834 

Note:  = [l-t^(r)]r 
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Asymptotic  Form  of  the  Optimal  Stopping  Rules: 

Select  Arrival  of  Relative  Rank  j Only  After  t.(r)-n  Previous  Arrivals 


ON  t—  ChH  OJ-3--3-  ON  O OJ 
C\JVOOO  LfN  H O -S' J-  KN  H 
H KN  OJ  O NKNCO  KNCO  (Q, 
KN-4-  LTN  VO  VO  t—  f—  CO  00  ON 


CO  -V  LTNVO  ON  VO  H KNCO  H f-CO  ONVO  O 
NOfCiNDt-QOOOJW  OMJNCO  O H C\I4- 
O O t—  KN  ON  KNCO  OJ  LTN  ON  OJ  VO  ONOJ  LTN 
KN-4-  -4-  LTN  LTNVO  VO  c—  t—  t—  CO  CO  co  onon 


•ral 


H OJ  KN-4-  LTNVO  C^CO  On  O 


H OJ  KN-4-  LTNVO  t— CO  ONO  H OJ  KN -V  LTN 

i — 1 i — if — I i — if — If — ! 


Hi 


o 

I — I 


O ovo  OJ  O KN  O 
ON  OJ  -V  C — CO  H OJ 

& 


>OJ  -V  t^oo  H 
H D — CO  D — LTN  KN 
KN-V  LTNVO  C-OO 


VO  H LTNVO  KN  LTN  O H 
VOCOOJCOKNQKN-V 
H LTNVO  -V  OJ  ON  LTN  H 
KN-V  LTNVO  t—  t— CO  ON 


VO  LTN  KN  O O 
-4-  VO  -4-  LTN  LTN  t — LTN 
H-V-VOJ  ON  LTN  t — [ VO 
KN-4"  LTNVO  VO  C^CO  CO 


•n| 


H OJ  KN -V  LTNVO  C—  H OJ  KN-4-  LTNVO  f- CO  H OJ  KN-4-  LTNVO  C— CO  ON 


HI 


IN- 


CO 


ON 


ON 

O IN- 

IN-CO  VO 

OJ  CO  H CO 

u 

IN- VO 

VO  VO  -4" 

OHNON 

' — 

VO 

-4  VO 

KNOO  IN- 

KN-4-  On  OJ 

•i“D 

1 ^ 

KN 

KN  VO 

KN  LTN  D— 

KN  LTNVO  CO 

LTNVO  IN-  t—  KN 
LTN  H f-  O KN 
OJ  H -4-  VO  VQ 
KN  LTNVO  t—  CO 


ONkn  O OJ  H CO 
H ON  OJ  KN  OJ  LTN 
OJ  CO  H H Q CO 
KN-4-  VO  C — CO 


•o| 


H OJ  H OJ  KN  rl  OJ  KN  -4  H OJ  KN  -4"  LTN  H OJ  KN  -4  LTNVO 


Hi 


OJ 


KN 


VO 


10 


9236 


TABLE  2 Continued 


oo  oj  oj  o\  oj  vo 
0 0-3“  o\  o vo 
O NOIVO  H J 
(O,  (O,  -3“  -3-  LTV  LTV 


0\00  t-  o 

~ CT\  t— 


_ M OJ  M H M LTV  VO  H O LTV  VO  3 OCO  J 

OVOUNf-QJ  MO  10  3 LTV  LTV  LT\-3"  OJ  O 00  VO  IO,  OJ 
M H (AVO  ONH3VOCOOai3VOOOOHlOlAt- 
- - - — ~~~o\o\o\ova\ 


LTV  VO  VO  VO  VO  c-— £— c-— mcococoooco 


•ol  H OJ  10,-3-  lev  VO  MB  0\0  H W IA3  Lf\VO  t-CO  B\0  H C\l  A3  lA 

< — ( c — l i — I r— I rM  f — I i — I i — lr — Ii — ! OJ  OJ  OJ  OJ  OJ  OJ 


Ml  OJ 


HV03  A®'  CT\-3-  HirvOMMHlOLTMMOJiHMi-l  LTV 
AAANOIAHVOOOoOt-AHVO  O -3"  M OVOJ  LTV 
0*00-3-  O\_3-C0  OJ  LTVOO  H3  t — Q OJ  LTV  m Ov  H3  VO 
rC\  (0-3-  -3"  UA  IOVO  VO  VO  t— MMUJCOCOCOCO  OvOv  OV 


-ol 


H OJ  A 


_3  ITVVO  C — CO  ON  O rl  OJ  (0,-3-  LfAVO  £ — CO  ONO 
i — If — I i — I r it — [i — It — If — ! i — It — I Ol 


Ml 


O 

CV1 
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Looking  at  Table  2,  we  see  that,  for  each  j,  t.(r)  is  decreasing 
with  r,  but  it  is  not  clear  — from  the  table  — what  the  limits  are. 

It  hardly  seems  plausible  that  for  each  j = 2,3,. •• 


(3.4)  t . (r ) - t,  (r ) 0 as  r -$»  oo  . 

Yet  this  is  indeed  the  ease;  and,  as  we  shall  see,  it  implies  the 
extraordinary  "almost  constant"  property,  of  the  optimal  stopping 
rule  when  n and  r are  large,  which  was  mentioned  in  the  intro- 
duction. 

Although  surprising,  (3.4)  is  not  hard  to  prove.  Recall  that 

t.(r)  is  defined  by  (2.10)  and  (2.8).  Because  Q (t)  is  increasing 
J r 

in  t while  B(j-l;r,t)  is  decreasing  in  t,  an  upper  bound  for 

t.(r)  is  any  9 such  that 
J 

B(j~ljr,0)  ^^(t-L  r))  = (l-t1(r))r  . 


Hence  it  is  sufficient  to  show  that,  for  each  j > 2,  and  for  any 
5 > 0,  (and  ignoring  r's  for  which  t^(r)  > 1-2&) 


lim  sup(l-t1(r  ))_r  BCj-ljr,^  (r  )+5  ) < 1 , 


which  is  true  because,  letting  0^  = t^(r)+5. 


j -1  , 9 k i 1-0  r 

O — 1/P  - 


(l-tl(r))-  B(j-l;r,er)  - V & (j^-)  ^ ( r 

P _L 


< 


Aj  rJ[1  - l-tpr)^  * 0 M r -»•  ~ • H 
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4.  The  Infinite  n Problem. 


A handy  tool  for  deriving  asymptotic  (r  -3»co)  results  is  the 
"infinite  n model"  as  presented  in  Gianini  and  Samuels  [4] 
and  in  Gianini  [3]« 

Let  an  infinite  sequence  of  rankable  individuals  arrive  at  times 
{Y^  = arrival  time  of  i-th  best)  which  are  IID,  each  uniformly 
distributed  on  (0,1 ).  We  want  to  consider  only  stopping  rules  which 
are  "based  only  on  relative  ranks;"  this  we  achieve,  formally,  by 
letting 


Y^(t)  = arrival  time  of  i-th  best  among 
those  which  arrive  by  t 

3(t)  = 3(Y1(t ),  Y2(t),...  ) , 

and  allowing  only  stopping  rules  which  are  adapted  to  the  g(t)'s, 

and  either  don't  stop  (i.e.,  defective  rules  are  allowed)  or  stop  at 

one  of  the  YJs  For  any  such  rule,  its  "r-risk"  is  the  probability 

that  it  fails  to  stop  at  one  of  the  times  Y^, ...,Y  • For  each  r, 

we  wish  to  minimize  the  r-risk  and  to  find  a rule  which  does  so. 

This  problem  has  been  shown  to  be  "the  limit"  of  Gusein-Zade' s. 

Specifically,  Q^(t),  the  solution  to  (2.9)  with  the  boundary  condition 

Q (1  ) = 1,  is  the  minimal  r-risk  among  all  stopping  rules  which  do  not 

stop  before  time  t.  Hence,  the  optimal  rule  waits  until  time  t^(r), 

then  stops  at  the  first  arrival  time  cr  > t_  (r),  where  Z is  the 

Z • cr 

cr 

relative  rank  of  the  arrival  at  time  cr,  and  the  t.(r)'s  are  as 

J 

defined  in  (2.10).  Therefore  any  asymptotic  result  we  obtain  for  the 
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infinite  n problems  as  r ->■  co , is  also  an  asymptotic  result  for 
Gusein-Zade' s problems,  as  first  n co  , then  r->  oo  . 

5.  Asymptotic  Results  for  the  Infinite  n Problem. 

First  we  shall  establish  that  the  optimal  r-risks,  Q^,  go  to 
zero  exponentially  fast.  This  will  be  in  two  parts: 

(5-1)  Qr  > 2"r  for  all  r=l,2,... 

and 

(5-2)  lim  sup  ^(t)1^  < infQ<a<t  max(ta,  (|)  (i^-)1  ) , 

for  all  t e (0,1 ). 

The  first  part  shows  that  the  rate  is  at  most  exponential;  the  second 
that  it  is  at  least  exponential. 

Because  = (l-t1(r))r,  (5.1)  and  (5-2)  imply 

(5-3)  1 " inf0  <a  < t < 1 max^t  , (5:)  ^ 

< lim  inf  t^(r)  < lim  sup  t^(r)  < . 

The  expression  in  the  curly  brackets  in  (5.2)  and  (5. 3)  will  be 
shown  to  be  the  limit  of  the  r-th  roots  of  the  r-risks  of  the  rules: 
"wait  until  time  t,  then  stop  with  the  first  arrival  of  relative 
rank  < Or,",  which  are  of  some  interest  in  themselves  — especially 


to  see  how  they  compare  with  the  optimal  rules.  So  we  have  computed 


some  values  which  are  presented  in  Table  3-  For  each  listed  ct -value, 
we  have  used  the  optimal  t -value  (the  t which  makes  the  two  terms 
in  the  curly  bracket  equal).  It  can  be  seen  that  the  optimal  (t,Q!) 
pair  is  t ~ .591  > a **  *309  and  the  left  side  of  (5*3)  is  about 

1 - .85  = .15. 

Once  the  exponential  rate  of  convergence  is  established  it  will 

be  possible  to  give  the  complete  asymptotic  form  of  the  optimal  rule 

l/r 

and  to  derive  the  limit  of  Q,  . To  describe  the  results  we  need  to 

r 

define  the  functions 


(5.1)-)  “(1)  = d/r  if  t = t . (r ) 

r 

= 0 if  t = 0 

= 1 if  t = 1 

linear  in  between; 

so  the  optimal  rule  for  getting  one  of  the  r best  can  be  stated 
as:  stop  at  the  first  arrival  time  t at  which  the  relative  rank 

of  the  current  arrival  is  < ct  (t)>r  . What  we  shall,  show  is  that 

— r 

l/r 

t-^(r),  ar(t),  and  Q^(t)  ' have  limits  related  in  this  way: 


(5.5) 


t-L  (r ) ->t* 


(5.6) 


«r(t)  *a(t) 


(5-7) 


r 


Qr(t) 


l/r 


(1-t*) 


f 

| / t ^(t),  i-t  q-a(t) 
(aTtJj  u-a(t) 


if 

t 

< t* 

if 

t 

> t* 
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TABLE  3 


Limit  of  r-th  Roots  of  r-Risks,  of  the  Rules: 
"Select  First  Arrival  After  Time  t(o:)  with 
Relative  Rank  < af" 


t(a) 

a 

lim  Q1//r 
r 

.3032 

.10 

.3875 

.4650 

.20 

.8580 

.5822 

•30 

.8502 

.5903 

.308 

.8501396 

.5913 

.309  ■ 

.8501389 

• 5923 

.310 

.8501394 

.6743 

.4o 

.8541 

.7500 

• 50 

.8660 

Note:  t(a)  = i-(i^)aa/^)  . 


1 6 


where 


(5-8) 


a(t)  = o 


for  t < t* 


^ from  0 to  1 


as  t f from  t*  to  1 


and,  on  (t*,l),  Qt(*  ) is  a solution  to  the  differential  equation 


(5-9) 


(t) 


[l-os(t)]/(l-t) 
lnlt[l-o!  (t ) J / (l-t  )a  (t)J  ‘ 


This  enables  t*  to  be  numerically  evaluated.  We  found  that  t*  ~ .2834. 

We  have  also  evaluated  a(t)  for  various  values  of  t > t (see 
Table  1)-).  It  is  true  enough  that  these  values  help  to  characterize  the 
asymptotic  form  of  the  optimal  rules.  But  there  is  less  here  than  meets 
the  eye,  for  the  fact  is  that,  for  any  t > t*,  and  for  large  r,  the 
optimal  rule  for  getting  one  of  the  r best  — call  it  t — will 
already  have  stopped  by  time  t,  with  high  probability.  In  other  words, 
as  we  shall  show: 


(5*10)  ■»  t*  in  probability  as  r 00  . 

This  implies,  that  for  each  e and  5 > 0 there  is  an  r(e,5),  and, 
for  each  r > r(e,5),  an  n(r,e,g),  such  that 

n > n(r,e,5)  =»P(|t  /n-t*|  > 5)  < e 

1 > 11 

where  t is  the  optimal  rule  for  selecting  one  of  the  r best  of 
r,  n 

n arrivals.  (This  clarifies  the  remark  at  the  end  of  the  introduction. ) 
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TABLE  4 


Asymptotic  Form  of  the  Optimal  Rules  For  the  Infinite-n  Problem, 
as  r Becomes  Infinite:  "Stop  at  Arrival  Time  t > t*  ~ .2834 
Only  if  Relative  Rank  < a(t)*r" 


t 

a(t) 

t 

'a(t) 

.2835 

.00001 

.65 

.300 

.2840 

.0001 

.70 

.373 

.2850 

.0003 

.75 

.454 

.29 

.001 

.80 

.543 

•30 

.004 

.85 

.642 

.35 

.024 

.90 

.751 

.40 

.053 

.95 

.870 

.45 

.088 

.96 

.895 

.50 

.130 

.97 

.921 

• 55 

.179 

. 98 

.947 

.60 

.236 

.99 

• 973 
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6.  Proofs  of  Results  in  Section  $. 


Proof  of  The  first  step  is  to  show  that,  for  each  r > 1, 

(6.i)  Qr  > t1(r)Qr_1  • 

To  see  this,  look  at  the  r-risk  only  on  the  event  that  the  overall  best 

individual  arrives  before  time  t^(r),  an  event  of  probability  t^(r). 

Since  the  optimal  rule  does  not  stop  before  time  t^(r),  its  conditional 

risk  on  this  event  is  its  probability  of  failing  to  select  one  of  the 

r-1  best  of  the  remaining  individuals  (other  than  the  overall  best); 

and,  most  important,  on  this  event  the  optimal  rule  is  based  only  on 

the  relative  ranks  of  the  reamining  individuals;  it  is:  "Wait  until 

t2(r),  then  stop  at  the  first  arrival  time  a > +1(r),  where 

cr 

is  the  relative  rank  of  the  arrival  at  time  cr,  with  respect  to  all 

previous  arrivals  other  than  the  overall  best."  Wow  the  arrival  times 

of  all  individuals  other  than  the  overall  best  are  themselves  IID, 

uniform  on  (0,1 ) and  independent  of  the  arrival  time  of  the  overall 

best;  hence  the  conditional  r-risk  is  at  least  ^(t^(r)),  which,  in 

turn,  is  at  least  Q . 

’ r-1 

Wow  suppoae  that,  for  some  r,  t^(r)  > l/2.  Then  > [l-t^(r)]Q  , 

but,  since,  for  each  k,  (l-t^(k))  = 

t^ (r-l)  > t^r).  Hence  t^ (r-1 ) > l/2.  But  this  leads  to  a contradiction, 
because  t^(l)  = e "*"  < l/2.  Therefore  t^(r)  < l/2  for  all  r,  which  is 
equivalent  to  (5*1). 


Qk,  this  is  equivalent  to 
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We  shall  need  the  following: 


(6.2)  Q(rjt)  > tr  for  all  t e (0,l) 

> (l-t)r  for  all  t e(t1(r),l)  • 

The  first  inequality  holds  because  t is  the  probability  of  the  event: 
"all  of  the  r best  arrive  before  time  t",  in  which  case  any  rule 
which  doesn' t stop  before  time  t cannot  possibly  select  one  of  the 
r best.  The  second  inequality  holds  because,  when  t > t^(r),  we  have 

Q(r;t)  > Qtyjt-^r))  = (l-t^r))1  > (l-t)r  . 

We  shall  also  need  to  use  the  following  fact  about  binomial 
probabilities: 


(6.3) 


lim 


r ->  oo 


(([arr])ttar](i-tf-[ar))l/r 


4~  a 
- & 


l +-  1*^ 

(iii) 

vl-a' 


= lim 


r -$> 


B([ar];r,t)' 


l/i 


if  0 < a < l 

if  o < a < t . 


The  first  equality  follows  immediately  from  Stirling's  formula;  the 
second  follows  from  the  first  and  the  inequality. 


k+1  l-t  _a_  i-t 

r-k  t 1-a  t 


k < [ar] 
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which  implies  that 


(6.4)  B([Or]jr,t)  < (1—  i^)'1((arr])t[arl(l-t)r-[“rl 
whenever  a < t. 

Proof  of  (5«2):  Fix  t and.  let  X be  the  actual  rank  of  the 
[ar]-th  best  of  those  which  arrive  by  time  t,  and  Q^(t,a)  be  the 
r-risk  of  the  rule:  "accept  the  first  arrival  after  time  t,  with 
relative  rank  < ar,  if  any".  If  [ar]  < X < r+l,  then  this  rule  will 
surely  select  one  of  the  r best;  while  if  X = x > r+l,  then  the 
conditional  probability  of  selecting  one  of  the  r best  can  be  shown 
to  be  r/(x-l)  which  is  at  most  r/(r+l).  Hence 

(6.5)  P(X=  [ar] ) +P(X  > r+l)  > Qp(t,a) 

> P(X=  [ar]  ) + (r+l)"1  p(X  > r+l)  . 

Wow 

P(X=  [ar] ) = t^ar^ 

and 

P(X  > r+l)  = B([ar]-1;  r+l,t)  ; 

so  when  we  take  the  r-th  root  of  all  sides  of  (6.5),  let  r -»oo, 

and  apply  (6.3),  we  find  that  both  the  upper  bound  and  the  lower 

l/r 

bound  for  Qi>(t,a)  ' converge  to  the  expression  in  the  curly  bracket 
in  (5-2).  Since  the  optimal  rules  must  do  at  least  as  well  as  these, 
(5.2)  must  hold.  || 
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We  note  for  future  reference  that,  for  fixed  t e (0,1 ), 


(6.6)  (^)  (~j)"'""Q'  t fro®  1-t  to  1 as  Gc  f from  0 to  t . 

Now  let  us  use  the  notation  (5*4)  to  re-write  the  differential 
equation  (2-9)  as 

[r«r(t)] 

(6.7)  Q7(t)  = t-1{r  a (t)Q.  (t)  - £ B(k-l;r,t)} 

r k=l 

and  note  that 

(6.8)  B([r  ar(t)]-l;r,t)  < Qr(t)  < B([r  ar(t)];r,t)  . 

Derivation  of  Re-writing  (6.7)  in  terms  of 

gr(t)  ^ Qr(t)l/r  , 

yields 

(6-9)  gj.(t)  = (rt)“1[r  ar(t)]gr(t)(l-hr(t)) 

where 

[r  ap(t)] 

(6.10)  hp(t)  = B(k-l:r,t)/[r  ar(t)]B([r  ar(t)]-l;r,t)  . 

It  can  be  shown,  using  (6.4)  and  the  unimodality  of  the  binomial  distri 
bution,  that 
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0 < P < a<t  =>  B([r6];r,t)/B([ra] jr,t)  >0  as 


r oo  . 


Hence  h (t)  -»0  as  r •»  oo  whenever  ct^(t)  ■XX(t)  e (0,t). 
Choose  any  weakly  convergent  subsequence, 

w 

6r  (•  ) -»  g(*  ) • 

i 

By  (6.2)  we  have,  necessarily. 


g(t)  > max  (t, 1-t)  on  (t*,l) 


where 


(6.11)  t*  = lim  t (r.^)  = inf{t:  g(t)  > g(0+)}  > 0 

and  g(t*)  = 1-t*.  Hence  by  (6.6)  we  can  represent  g(t)  in  the  form 


(6.12) 


g(t)  = 1-t* 


if  0 < t < t* 


, 1-t  %1~a(t) 

3L-a(t); 


if 


t*  < t < 1 


with 


a(t)  < t and  increasing  from  0 to  1 on  (t*,l). 

And,  from  (6.3),  (6.6),  and  (6.8),  it  follows  that,  necessarily. 


w 

«r  (* ) ->  <*(• ) . 
l 
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Hence,  from  (6.9), 


(t)  t_1  a(t)g(t)  a.e.(t)  ; 

i 

and  since  the  g^(*)'s  are  uniformly  bounded  (recall  that  g^(t)  =0 
on  (Ojt^r))  and  t^(r^)  t*  > 0),  we  can  apply  the  dominated 
convergence  theorem  to  conclude  that  g(.'» ) is  differentiable  with 

(6.13)  g'  (t)  = t"1  a(t)g(t)  on  (t*,l) 

= 0 on  (0,t*) 

(6.ih-)  g(D  = l • 

Since  this  is  true  for  every  weakly  convergent  subsequence,  and  since 
the  differential  equation  with  the  boundary  condition  uniquely  determines 
g(*  ),  including  the  value  t*,  we  conclude  that 

Qr(t)1//r  ->g(t)  as  r ->00  , 

where  g(*  ) is  implicitly  defined  by  (6.12),  (6.13),  and  (6.1^). 

Re-writing  (6.13)  in  terms  of  &(•),  using  (6.12),  yields 

(5-9).  II 

Proof  of  (3.10):  We  first  note  that,  for  any  j and  t > t.(r), 

tJ 

is  less  than  t on  the  event:  "at  least  one  of  the  j best  arri- 
vals  by  time  t has  arrived  after  time  t.(r)".  This  event  has 

tJ 

probability  l-[t.(r)/t]J  and  is  independent  of  3(t.(r)).  Hence,  for 
any  6 > 0, 

2k 


P(rr  > t1(r)  + &)  < infj>1{tj(r)/t1(r-t6)}D  . 

The  right  side  goes  to  zero  as  r oo,  by  (3 A),  and  t^(r)  ->t*, 
which  completes  the  proof.  || 

7*  Concluding  Remarks. 

A.  The  policy:  "Observe  only  the  first  ^(r^n)  applicants,  then 
choose  the  best  of  these"  has  probability  approximately  (l-t^(r))  of 
selecting  one  of  the  r best  of  all  n applicants.  The  optimal  stopping 
rule  based  only  on  relative  ranks  has  virtually  the  same  risk  as  this 
policy  for  all  n and  r,  and,  for  large  n and  r,  stops  nearly  as 
soon  as  this  policy  does  — if  we  are  willing  to  ignore  "times"  of 
smaller  order  than  n — according  to  (5. 10). 

B.  The  limiting  optimal  proportion  of  individuals  to  let  go  by 

before  being  willing  to  stop  (as  n ->oo)  is  nearly .37  in  the  classical 
best  choice  problem  (r  = l),  and,  as  we  have  shown,  tends  to  about 
.28  as  r ->oo.  In  the  so-called  "rank  problem",  where  the  object 
is  to  minimize  the  expected  rank  of  the  individual  chosen,  the  limiting 
optimal  proportion  to  let  go  by  is  | L ...  { j/(j+2))lj^^+1^  which 

1 <•  J < CO 

is  slightly  less  than  .26,  as  shown  in  Chow,  Moriguti,  Robbins,  and 

Samuels  [1]  . And  in  a problem  where  the  n observations  are  not 

relative  ranks,  but  are  IID,  uniformly  distributed  on  an  unknown  interval, 

and  the  object  is  to  minimize  the  expected  quantile  of  the  observation 

, _ -1//2 

chosen,  the  minimax  rule,  as  n ->co,  lets  approximately  (3  + 2y2  ) • n, 

or  about  29$  of  the  observations,  go  by  before  being  willing  to  stop, 
as  shown  in  Samuels  [10]. 


25 


C.  The  completely  random  order  of  arrivals  is  crucial.  If  the 


actual  ranks  of  the  successive  arrivals  were  cyclical: 


Z , Z-l,...,2,  1,  n,  n-l,...,Z+l  , 


with  Z equally  likely  to  be  1,2,...,  or  n,  then  no  stopping 
rule  based  on  relative  ranks  has  probability  greater  than  r/n  of 
selecting  one  of  the  r best.  And  since  the  randomized  rule  ”T  = k 
with  probability  l/n,  for  k = l,2,...,n,  regardless  of  the  data" 
has  this  success  probability  for  every  arrival  ordering,  the  cyclical 
ordering  above  may  be  called  an  optimal  counter  strategy.  For  another 
optimal  counter  strategy,  see  Samuels  [9]. 
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We  study  the  problem  of  selecting  one  of  the  r best  of  n 
rankable  individuals  arriving  in  random  order,  in  which  selection 
must  be  made  with  a stopping  rule  based  only  on  the  relative  ranks 
of  the  successive  arrivals.  For  each  r up  to  r = 25,  we  give 
the  limiting  (as  n -*  oo)  optimal  risk  (probability  of  not  selecting 
one  of  the  r best)  and  the  limiting  optimal  proportion  of  indivi- 
duals to  let  go  by  before  being  willing  to  stop.  (The  complete 
limiting  form  of  the  optimal  stopping  rule  is  presented  for  each  r 
up  to  r = 10,  and  for  r = 15,  20  and  25. ) We  show  that,  for  large 
n and  r,  the  optimal  risk  is  approximately  (l-t*)r  where  t*  « .283^ 
is  obtained  as  the  root  of  a function  which  is  the  solution  to  a certain 

differential  equation;  and  the  optimal  stopping  rule  t lets  approxi- 

r,  n 

* 

mately  t n arrivals  go  by  then  stops  "almost  immediately"  ifl  the  sense 
that  ^/n  -*•  t*  in  probability  as  n ■*  oo,  r -*  oo  . 
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